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A CLONE OF DOTBUE-STSANM) HNA VIRUS APPLIED TO AKUBOifl 
PRODUCTION, STUDY OF REERCVIRUS-LIKE FBAMESHIFCING AND 
PRODUCTION OF PROIEINS IN YEAST 



This is a continuation-in-part of the 
pending application Serial Number 07/311 , 217 filed 
April S, 1989, which is a continuation-in-part of 
the pending application Serial Number 07/169,486 
filed March 17, 1988. 

BACKGROUND OF THE INVENTION 
Field of the invention 

The present invention relates to a DNA 
segment that encodes a double-stranded RNA genome 
of a yeast virus. More particularly, the present 
invention is related to a cDNA clone of the genome 
of the L-A virus, or a unique portion thereof, and 
various applications of this clone, including a 
method for determining the effect of an agent on 
translational frameshifting directed by a viral 
nucleotide sequence* 
Background 

Viruses use a wide variety of strategies 
to replicate and differentially express the 
proteins they encode. Retroviruses, double- 
stranded RNA (dsRNA) viruses and (+) strand RNA 
viruses including picornaviruses and togaviruses, 
have in common their use of genomic (+) strand RNA 
as a template for replication and as a message for 
protein synthesis. DNA viruses and (-) strand RNA 
viruses use splicing of mRNA to produce, in 
different amounts, proteins sharing part of their 
sequence (Livingston et al., 1965, Replication of 
Papovaviruses . In Virology, B.N. Fields, ed.. New 
York: Raven Press, pp. 393-410; Horvitz, 1985, 
Adenoviruses and their replication. In Virology, 
B.N. Fields, ed. New York: Raven Press, pp. 433- 
476; Kingsbury, 1965, Orthomyxo- and 
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Paramyxoviruses and their replication. In 
Virology, B.N. Fields, ed.. New York: Raven 
Press, pp. 1157-1178) . However, splicing is 
unknown among (+) strand RNA viruses and dsRNA 
5 viruses* probably because the spliced RNA would be 
packaged and replicated leading to the 
accumulation of defective viruses. Retroviruses 
splice their genomic RNA to make the env protein, 
but the spliced RNA lacks the Psi sequence 
10 necessary for packaging (Watanabe et al., 1983, 
MojL Cell. Biol . 3:2241-2249; Mann et al., 1983, 
Cell 331153-159; Markowitz et al., 1988, J. Virol . 
.< 62:1120-1124).. In addition, retroviruses use 

ribosomal frameshiftlng to make a large amount of 
15 the gag protein and a small amount of the gag-pol 
fusion protein (Jacks et al., 1988, Cell 55 :447- 
458/ Jacks and Varnais, 1985, Science 230 :1237- 
1242; Kramer et al., 1986. HTLV-III gag protein 
is processed in yeast cells by the virus pol- 
20 protease. Science 231 :1580-1584). Murine 

leukemia virus (Various, 1988, Science 240 :1427- 
1435) and two a viruses (Strauss et al., 1983, 
Proc, Natl. Acad. Scl, PSA 80:5271-5275) use 
nonsense suppression in the same way. The rates 
25 of transcription and translation of different 

reovirus dsRNA segments vary over a range of more 
than 20-fold to give overall rates of expression 
that vary ov6: **00-f old (Joklik, 1981, Microbiol 
Rev. 45^483-501). 
30 L-A is a dsRNA virus of the yeast 

. Saccharomyces - cerevisiae . L-A replicates by a 
conservative Toechanism, with (+) strands made by 
transcription of dsRNA and (-) strands then made 
by copying the (+) strands to form dsRNA again 
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(sequential synthesis) (Newman et al., 1981, J. 
Virol 38:263-271; Sclafani et al. # 1984, Mol. 
Cell.- Biol . 4^1618-1626; Newman et al., 1986, The 
replication' of dsRNA. In Extrachromosomal Genetic 
5 Elements in Lower Eukaryotes, R.B. Wickner, A. 
Hinebusch, A., M. Lambowitz, I.C. Gunsalus, A« 
Hollaender, eds., New York: Plenum Press, pp. 
173-187; Fujimura et al. 1986, Proc. Natl. Acad. 
Scl. PSA 83:4433-4437) . (♦) single-stranded RHA 

10 (ssRNA) is packaged in a new coat to form new 
viral particles (Fujimura et al., 1986, supra ; 
Fujimura et al., 1987, Mol. Cell. Biol . 7:420- 
426) . Both (+) and (-) strands are synthesized 
within viral particles, and because the size of 

15 the heads appears to be designed to contain one L- 

A molecule (4.6 kb) , dsRNAs less than half the 

« 

size of L-A replicate within the head until they 
fill it with multiple copies. 

The 4.6 kb L-A dsRNA genome with a coding 

20 capacity of about 180 kDa encodes both the 80 kDa 
major coat protein (over 100 molecules per viral 
particle) and a 180 kDa minor viral protein (about 
1 molecule per particle) . The 180 kDa protein 
shares immunodeterminants with the major coat 

25 protein and, unlike the major coat protein, has 

ssRNA-blnding activity which could play a role in 
both packaging and replication. L-A carries 
several genetic activities, called [HOK] , [NEX] t 

> 

[EXL] and (B] defined by the interactions of L-A 
30 with M dsRNA (see review by Nickner, 1986, Ann. 
% Rev. Biochem . 55:373-395; Qemura et al., 1988, 

Mol. Cell. Biol . Q: 938-944) . M t is a da RHA that 
encodes a secreted toxin and immunity to that 
toxin (Bussey, 1988, Yeast 4:17-26) and uses the 
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same particles as L-A for replication. Although 
both L-A and M* are affected by a variety of 
chromosomal genes, only three of the genes 
required for the maintenance of ^ (MAK3, MAK10, 
5 and PET18) are necessary for the maintenance of L— 
A (see review by Wickner, 1986/ supra ) . A 
deletion mutant of L-A, called X dsRNA, requires 
most, if not all, of the MAK genes (Esteban et 
al., 1988, J. Virol / 62:1278-1285) . 

10 In order to determine how both the 80 kOa 

and 180 kDa viral proteins can be encoded by the 
4.6 kb L-A molecule and to study their 
differential expression, structure, function, and 
relation to the genetic activities of L-A, it was 

15 desirable to clone and sequence the L-A genome, 
such a clone not previously being available. 

SUMMARY OF THE INVENTION 

- It is, therefore, an object of the 
present invention to provide a substantially 
20 complete cDNA clone of the L-A viral genome of 25. 
cerevlslae . 

It is another object of the present 
invention to provide a yeast test system for 
identifying factors influencing or controlling the 
25 -1 riboaomal f rameshif ting event that occurs 
during gag-pol fusion protein synthesis in 
retroviruses including HIV. 

. It is a further object of the present 
invention to provide a new yeast-based vector for 
30 preparing paxticles-immobilized antigens for 

inducing an immune response in a responsive host. 

It is yet another object of the present 
invention to provide a stable, recombinant cDNA 
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clone of L-A at a copy number of at least about 
10,000 per cell to produce a desired protein in 
large, amounts* 

Other objects and advantages of the 
5 present invention will become evident from the 
following detailed description of the invention* 

BRIEF DESCRIPTION OF THE DRAWINGS 
These and other objects, features and 
many of the attendant advantages of the invention 

10 will be better understood upon a reading of the 

following detailed description when considered in 
connection with the accompanying drawings wherein: 

Figure 1 shows a summary of the overall 
structure of the L-A genome* The two open reading 

15 frames, ORF1 and 0RF2 encode the two products of 
the L-A genome, the major coat protein of the 
viral particles and the chimeric protein having a 
major coat protein domain and a single-stranded 
RKA binding -RNA polymerase domain. 

20 Figure 2 shows the complete sequence of 

the L-A genome and specific features thereof, 
including possible frameshifting sequences near 
base 2000. 

Figure 3 shows the similarity between the 
25 sequences of HIV-1 and RSV that are known to 

produce ribosomal frameshifting and sequences in 
L-A in the 130 base region of overlap of ORF1 and 
ORF2. 

Figure 4 shows the homology between L-A's 
30 ORFs and the RHA-dependent RNA polymerase genes of 
a number of RNA viruses that allowed the 
identification of the 180 kDa protein as the L-A 
RKA polymerase. 
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Figure 5 shows the schematic design of a 
frameshift vector based on the frameshift 
sequences of the L-A virus. 

Figure 6 shows constructs useful for 
5 producing particles carrying multiple copies of a 
protein antigen on its surface for inducing an 
immune response* 

Figure 7, shows the use of a vector to 
support the replication and increase the copy 

10 hummer of the X dsRNA-based expression vector. 

« 

Figure 8 shows the structure of the L-A 
mRNA in the frameshift region that is involved in 
the "simultaneous slippage" mechanism, including 
the "slippery sequence"/ stem-loop and 
15 "pseudoknot" elements, and the use of these 
elements in combination with the beta- 
•' galactosidase 'indicator gene to provide a system 
for detection of frameshifting efficiency. 

Figure 9 shows the sequences of three 
20 vectors for frameshift detection, pF7, pF8 and 
pF9. pF9* is the 0 frame vector having two stop 
codons before the beta-galactosidase gene in pF9 
to provide a control for the level of a downstream 
start producing beta-galactosidase. pF8 is the -1 

« 

25 " frame vector which detects the level of 

frameshifting into the -1 frame. pF7 is the +1 
vector which detects shifting into the +1 frame. 
pT126 is a. 0 frame (in-frame) vector with no stop 
codons which measures the amount of beta- 

- 

30 galactosidase made from the promoter-translational 
start site combination of these vectors. 

Figure 10 shows the minimal sequence of 

mm • 

the vector for the assay of frameshifting in the 
. region from the PGK1 OAS -promoter through the 
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artificial translation start site, the shifting 
sequence derived from L-A and the beta- 

* 

galactosidase indicator gene. The PGK1 OAS- 
promoter was selected for reasons of convenience, 
5 but it will be recognized by one skilled in the 
art that any other yeast promoter, such as the 
GAL1 or PYK1 promoters, for example, can be used 
to provide transcription of the frameshifting 
region and indicator gene. The vector also 

10 includes the TRP1 gene and the 2 micron DNA 

replicon as well as sequences from pBR322 (not 
shown) which are used according to methods well 
known in the art for providing replication 
functions for the vector in bacterial and yeast 

15 host cells. 

* 

* 

DETAILED DESCRIPTION OF THE INVENTION 
The above and various other objects and 
advantages of the present invention are achieved 
by a cDNA clone of the L-A virus genome of S. 
20 cerevlsiae . 

Unless defined otherwise, all technical 
and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary 
skill in the art to which this invention belongs. 
25 Although any methods and materials similar or 

equivalent to those described herein can be used 
in the practice or testing of the present 
invention, the preferred methods and materials are 
now described. Unless mentioned otherwise, the 
30 techniques employed herein are standard 

methodologies well known to one of ordinary skill 
in the art. 
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The term "substantially complete" cDHA 
clone, as used herein, means that the clone has 
the complete nucleotide sequence of the L-A viral 
genome except for a few bases at each end which, 
5 however, are easily repaired by using 
oligonucleotide mutagenesis. 

The term, "modified" cDKA as used herein 
means (1) modified to include the sequence 
encoding the protein whose production is desired; 

10 (2) modified to place the appropriate region with 
this encoding sequence downstream from a suitable 
promoter and upstream from a suitable 
transcription terminator; (3) modified In a 
suitable region to optimize the amount of 

15 transcript production by L-A (with the protein 
coding sequeqce) ; (4) modified to optimize the 
level of translation of the mRHA produced by L-A; 
(51 modified to result in secretion of the 

* 

synthesized protein; (6) modified to include a 
20 multiple cloning site in an appropriate region, 
and the like. 

One skilled in the art will understand 
that the term "unique" with respect to an amino 
acid sequence means a sequence consisting of about 
25 four to six $mino acids which is not found in 
another protein. Similarly; the term "unique" 
with respect to a nucleotide sequence means a 
sequence consisting of about 12 to 18 nucleotides 
which is not found in another polynucleotide. 

30 MATERIALS AND METHODS 

Materials 

M-MLV reverse transcriptase, RHase H, and 
DKA polymerase I were obtained as a kit from 



WO 90/12488 - 9 - PCT/US90/0I717 

Bethesda Research Laboratories. Exonuclease III 
and SI nuclease were from Promega Biotec. 
Modified T7 DMA polymerase (Sequence) and T4 
polynucleotide kinase were from United States 
5 Biochemical. Other enzymes and substrates were 
obtained from Bethesda Research Laboratories, New 
England Biolabs, New England Muclear, LKB- 
Pharmacia or Boehringer Mannheim* 

Saccharomyces cerevlslae strain RE59 (a 
10 argl-1 ski2-2 L-A-HNB [D] L-BC W) was used as a 

source of VLPs. This strain contains a variant of 
L-A, L-A-HNB, which results in high levels of L-A 
viral particles (Uemura et al., 1988, Mol, Cell. 
Biol . £:938-944). This strain has no M x killer 
15 dsRNA, which, if present, would strongly repress 
L-A replication, and the chromosomal ski2~2 
mutation also contributes to the higher yield of 
VLPs. Escherichia coli strains, DH5a(F~, endAl, 
hsdR17 (r~,m 4 > .supE44, thi-1, recAl, gyrA96, 
20 relAl, A(argF-lacZYA)Ul69, $801acZAM15) and 

DRSctF' , were from Bethesda Research Laboratories. 
Plasmids SK*, SfT, and M13 helper phage, R406, were 
from Stratagene. 
Construction of cDNA Clone 
25 L-A cDMA was synthesized using (+) strand 

message of L-A, synthesized in vitro, as a 
template. L-A VLPs were prepared from strain RE59 
as described by Fujimura et al., 1986, Proc. Natl. 
Acad. Sci. PSA 83:4433-4437, except that the 
30 concentration of NaCl in the extraction buffer 
(buffer A) was raised to 500 mM to stabilize 
particles, and CsCl density gradient 
centrifugation was performed twice. L-A (+) 
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strand RNA was synthesized from the purified VLPs 
as described by Welsh et al., 1980, Nucleic Acids 
Res. 8:2349-2363. To obtain the highest yield of 
the product per template in a 200 ul reaction 
5 volume, each substrate (NT?) concentration was 
raised to 1.5 mMhnd the MgCl, concentration was 
adjusted to 10 mM. After 90 minutes incubation at 
30°C, the reaction was stopped by phenol, and then 
standard phenol-chloroform extraction followed* 

10 After etnanol precipitation, the mixture of L-A 
ssRNA and dsRNA was used directly as a template 
for the CDHA synthesis without further 
manipulation. The presence of ds RNA interfered 
neither with cDNA synthesis nor with the ligation 

15 reaction.' 

x. Two synthetic primers, 
GCATATGGGTAATTCCCATTATCTTTTTGGC {PRIMER I) and 
GAAAAATTTTTAAATTCATATAACTCCCC (PRIMER II), were 

used to synthesize L-A cDNA. These primer 
20 sequences were based on the 3' and 5' terminal 

sequences of L-A dsKNA (Thiele et al., 1984, Mol. 
Cell. Biol . 4*92-100). The 5' -termini of these 
primers were phosphorylated by T4 polynucleotide 
kinase. ' The first strand was synthesized with M- 

■ . * 

2$ KLV reverse^tabanscriptase following the 

manufacturer' s protocol^ except that oligo dT was 
replaced by the primer t. After the first strand 
*- syntheses'*, the primer II was included in the 

reaction mlxturei annealed, and the second strand 

-j • 

30 synthesis w^s performed using DMA polymerase I 

* ■ 

. with RNaseH. ... This use of the second primer 

* • • < . * '* 

allowed us to generate, cDHA having an intact 5'- 

.... ■ ■ . ■ 

* * « 

<ejnd. No labeled nucleotide was used at any point 
in the reaction process. The reaction was 
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terminated by a phenol extraction and the full 
size cDNA which runs at the same position as dsRNA 
was purified from a preparative agarose gel using 
DEAE-cellulose paper. The size fractionated cDNA 

5 was ligated into the Smal site of the multiple 
cloning site of a Bluescript vector, SK*. The 
ligated DHA mixture was transformed into frozen 
competent cells of £• coli strain DHSa prepared by 
the method of Hanahan, 1983, J. Mol. Biol . 

10 166:557-50. Transformants were selected on X-gal 
ampicillin plates and white and pale blue colonies 
were screened for their plasmid DNA inserts* 

Initially 108 transformants were screened 
for their insert DNA size, by digesting mini-prep 

15 DNA with flindlll and BamHI. Among them, 39 clones 
were close to the full size. These clones were 
further tested by digesting the plasmids by EcoRI, 
EcoRV, and BamHl, and comparing the size of the 
fragments derived from the 5' and 3' ends of the 

20 L-A cOHA. The digested fragments were transferred 
from an agarose gel to two pieces of 
nitrocellulose paper and hybridized with "P end- 
labeled primers I and IX, respectively. Among 
those which showed strong hybridization to both 

25 primers were L03, LOS, and L28. 

DNA Sequencing of L-A cDNA Clone 

The plasmids, L03 and L05, have L-A cDHA 
inserts whose orientation is such that the 5' end 
of the L-A <+) strand sequence is next to the 
30 universal primer site in the SK* vector, whereas 
the insert in L28 has the opposite orientation. 
These plasmids were digested at the Hindi 1 1 and 
Apal sites in the multiple cloning site of the SK* 
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vector, and a series of deletions were generated 
using exoIII and SI nuclease (Henikoff, 1987, 
Methods Enzymol . 155 :156-165) . ssDNAs were 
prepared following the protocol supplied by 
5 Stratagene with the following modifications. 

Fresh colonies of strain DH5aF' carrying a plasmid 
were inoculated into 2 ml of 2xxT medium in a 50 
ml disposable plastic tube, cultured at 37*C for 2 
hours, and Ml 3 helper phage, R408, was infected at 

10 an moi of about 10. After vigorous shaking for 
two hours at 37°C, the temperature was dropped to 
32°C, and the shaking was continued overnight* 
Although the reduction of the temperature is not 
essential, this method gave a consistent yield of 

15 rescued ssDNAs in a variety of deletion mutants 

constructed. Since F-pili are dissociated at the 
reduced temperature (Messing, 1983, Methods 
Enzymol . 101:20-78), this modification prevented 
reinfection by the helper phages which could 

20 eventually lyse the cells, especially those cells 
which have lost the plasmid during the overnight 
cultivation. 

For each plasmid, about 50 deletions were 
sequenced using Sequenase following the 

25 manufacturer's protocol. The average reading was 
400 bp. For clones in the SK* vector, the M13 -20 
17 mer was used as a primer. Also, BauBl/Hlndlll 
fragments from 6 cDNA clones were recloned into 
the SK"- vector, and the opposite ends were 

30 sequenced using a reverse sequencing primer. The 
DNA sequences were assembled and analyzed using 
programs of UWGCG (Devereux et al. # 1984, Nucleic 

^^^^^^^^^^^^^^^^^^^^^^ 

Acids Res , 12:387-395), IDEAS, ST ADEN, and PIR 
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included in the "Analysis" library at the National 
Cancer Institute Computer Center at Frederick, 
Maryland. Homology was also examined using the 
FASTA (Pearson et al., 1988, Proc. Natl. Acad, Scl 
5 PSA 85:2444-2448) and FPAT programs provided by 
David Lipuan, Chuck Buckler and William Pearson. 

A deposit of the cDNA clone of the L-A 
dsRNA has been made at the American Type Culture 
Collection, Rockville, Maryland, on February 10, 

10 1989 under accession number 67,888. The deposit 
shall be viably maintained, replacing if it 
becomes non-viable, for the life of the patent or 
a period of 30 years from the date of the deposit, 
or for 5 years from the last date of request for a 

15 sample of the deposit, whichever is longer, and 
made available to the public without restriction 
in accordance with the provisions of the law. The 
Commissioner of Patents and Trademarks, upon 
request, shall have access to the deposit. 

20 Among various utilities of the clone of 

the present invention, at least the following 
shall be noted. 

1. To identify factors influencing the 
ribosomal frameshif ting that retroviruses, 

25 including HIV, use to make the gag-pol fusion 
protein which is vital to viral replication. 

2. To prepare particles that have a 
particular antigen protruding from multiple sites 
on the particle surface for use in immunization or 

30 in biological and immunological tests. 

3. To provide an RNA expression vector 
and a helper for such an RNA expression vector. 
These utilities are now described. 
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Use of cDNA clones of L-A in examining factors 
affectingr ribosomal f rameshlftinq . 

A study of the DMA sequence of the cDHA 
clone of the present invention reveals that the 
5 4579 bp L-A has two open reading frames (ORFs) . 
ORF1 of 660 amino acids extends from base 30 to 
base 2072 of the L-A(+) strand and encodes the 
major coat protein.. A second ORF of 868 amino 
acids (ORF2) extends from base 1940 to 4546 and 
10 encodes part of the ssRNA-binding protein. The 
other part of this 180 kDa ssRHA-binding protein 
is encoded by. 0RF1. Fusion of ORF1 and 0RF2, 

* 

apparently by a -1 translational frameshift, 
produces the complete 180 kDa protein. A site 
15 ' r sjjailar to the site of f rameshif ting in HIV, RSV 
and othfcr retroviruses is found within the 130 bp 

r 

. overlap of OKF1 and ORF2 (Fig. 3) . ORF2 also 
contains a sequence characteristic of the RNA- 
dependent HNA polymerases of several picorna- and 

20 togaviruses (Fig. 4) . 

Retroviruses, including HIV, use 
ribosomal frameshifting to make a large amount of 
the gag protein (the major coat protein of the 
viral cores), and a small amount of the gag-pol 

25 fusion protean. The gag-pol ; fusion protein 

contains a number of domains, such as a domain 

• * 

that is identical to the gag protein itself, a 
domain that .has the reverse transcriptase 

■ % activity, and; other protease, integrase and RHase 

• • i*. 

30 . H domains, this fusion protein is essential for 

.•-*-. . ■■ 

the retroviral replication cycle and, therefore, 
interference with the process of ribosomal 
frameshifting, either by decreasing its efficiency 
or increasing its efficiency, would be expected to 
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block the retroviral replication cycle. At 
present, -1 ribosomal frameshifting is not known 
to be. used by eukaryotic cells for their own 
purposes. Most known cases are in retroviruses , 
5 while one is in a coronavirus, one in the L-A 
double-stranded RNA virus of yeast (described 
herein) and several suspected cases in (♦) strand 
RHA viruses. This means that, based on evidence 
to date, a drug specifically affecting ribosomal 
10 frameshifting would be effective against various 
viral infections, mostly retroviruses, but would 
not adversely affect the host. In order to detect 
and develop antiviral agents which act by 
influencing frameshifting, it is necessary to have 
15 a simple system in which the effect of various 

drugs, host factors or other perturbations on the 
efficiency of ribosomal ' frameshifting can be 
easily tested and determined. 

The retroviruses must 'make the host 
20 ribosomes carry out frameshifting from the 

upstream gag reading frame to the downstream pol 
reading frame. This must be carried out while the 
ribosomes are in the rather small region (201 bp 
in the case of HIV) in which the two ORFs overlap. 
25 In all known cases, mammalian retroviruses carry 
out a shift of -1 base in the overlap region. 

As mentioned above, ORF1 of L-A encodes 
the major coat protein of the viral particles in 
which L-A is found in the cell (like gag of 
30 retroviruses) • ORF2 encodes a part of the 160 kDa 
protein with homology to RNA-dependent RHA • 
polymerases that has single-stranded RHA binding 
activity. Like the gag protein of retroviruses, 
the L-A major coat protein encoded by 0RF1 is made 
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in large amounts. Like the gag-pol fusion protein 
of retroviruses, the ORF1-ORF2 fusion protein (the 
180 kQa protein) is made in small amounts. The 
mechanism appears to be same. The shift from 0RF1 
5 to OBF2 must occur in the 130 bp region of overlap 
between ORF1 and ORF2. The exact site of the 
shift in HIV, RSV and in several other 
retroviruses has been precisely determined and has 
been shown to occur at the sites shown in Fig. 3 

10 -for HIV and for RSV. The essential elements of 

this site are the sequences, 5' UUUUUUA 3' for HIV 
or 5 r AAAUUOA 3' for RSV, followed closely by an 
inverted' repeat sequence that most likely 
functions to slow down the ribosomes. As shown in 

15 Fig. 3, a similar sequence was found in L-A in the 
130 bp overlap region. The reading frame in this 
homologous region is the same as that in HIV where 
it frameshifts. The ORF1 frame in this GGGUOTJA 
sequence is . such that a -1 base "simultaneous 

20 slippage" (Jacks et al., 1988, Nature 331:280-283; 
Jacks et al., 1988, Cell , 55:447-458) of peptidyl 
tRNA flr bound to the GGU and leucyl tRNA*"* bound to 
the OQA leaves the non-wobble bases of each tRNA 
anticodon still paired, to the mRNA. This 

25 mechanism has been demonstrated by Jacks et al., 
supra . , for RSV, and is strongly suggested for 
other retroviruses which frameshift. The L-A 
double-stranded RNA virus makes its gag-pol fusion 
protein by the same mechanism (see Fig. 8) as that 

30 used by retroviruses, namely, a -1 ribosomal 

frameshifting event by the "simultaneous slippage" 

* 

mechanism described for Rous Sarcoma Virus (Jacks 
and Various, 1985, Science 230 ; 1237-1242: Jacks and 
Varmus, 1988, Hature 331:280-283) . The general 
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structure and sequence pattern at the site of the 
frameshift is the same as that used by 
retroviruses, namely a short "slippery sequence" 
(GGGUUUAGC for L-A) upstream from a stem-loop 
5 structure which has sequences complementary to a 
downstream region such that a secondary structure 
known as a "pseudoknot" can be formed in the mRNA 
(Fig. 8). 

Since ribosomal frame shifting is an 

10 interaction of the ribosome and the mRNA, it is 

reasonable to use any message construct to examine 
its mechanism whether or not that construct comes 
from a retrovirus or not* At the time the 
ribosomes are frameshiftlng on HIV message, they 
* 15 have no way of knowing whether the RHA came from a 
retrovirus or not. The yeast system of the 
present invention provides easy genetic 
manipulability for host genes, viral components 
and clones. • 

20 In order to make a construct, the region 

at which the frameshiftlng occurs is placed 
upstream of an indicator gene for detection of 
frameshiftlng, for example, the beta-galactosidase 
( lacZ) gene of E. coli, which can be detected by 

25 an indicator dye, or a gene such as LEU2 or URA3 
or LYS2 or CAN1 for which cells having increased 
or decreased expression can be selected (Fig. 5) . 
For example, cells expressing the products of the 
LEU2, ORA3, or LYS2 genes, which are involved in 

30 intermediary metabolism, can be selected by 

omission .of a required nutrient from the culture 
medium. In addition, under other conditions, 
cells which are not expressing the same UEA3, LYS2 
or CAN1 genes can be selected by addition to the 
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medium of a metabolic inhibitor for which toxicity 
is mediated by the URA3 or LYS2 gene product. 

In construction of the vectors for 

frameshift detection, the reading frame is 

•a " 

5 adjusted so that only if f rameshif ting occurs will 
the downstream signal gene be expressed. One or 
more' of these constructs (Fig* 5) are introduced 
into yeast strains and mutants are isolated in 
. genes that affect f rameshifting efficiency by 
10 either increasing^ pr decreasing it. Drugs are 

' screened for their effects on frameshift ing using 
Ceils gbntaining these "frameshift ing vectors". A 

* • 

decrease in frame shifting would interfere with 

♦ 

. retroviral replication by interfering with the 
15 supply of reverse transcriptase* Monitoring the 
effects on more than one construct makes the 
. screening of effects peculiar to one signal gene a 
simple matter. Also, r pther control constructs are 
prepared in which the signal gene is in frame with 
20 the upstream region so that no frameshift is 
r K . rfeedetf for expression • Yet other control 

constiaictf have the signal gene in the +1 frame. 
: A set of vectors has been constructed 
j with the frameshift site of the L-A virus 
25 downstream of a yeast constitutive promotor and 
. upstream of the lacZ gene of £. coll in each of 
. the three possible reading frames (Figures 9 and 
16) . Stop codons are placed such that only a 
frameshift in the region of the overlap of the 
30 open reading frames can generate active beta* 
galactosidase. If 100% is the level of beta- 
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galactosidase produced from the vector with the 
lacZ gene in frame with the promotor, then the -1 
frame- vector produces 2% beta-galactosidase 
activity and the +1 frame vector produces only 
5 about 0.05% beta-galactosidase. Thus, this 
structure produces specifically -1 ribosomal 
frameshifting. It has been confirmed by 
subcloning experiments that this specific region 
(L-A bases 1950 to 2028 from the 5' end of the <+) 
10 strand) is actually responsible for the 

frameshifting. Based on the current understanding 
of the mechanism of frameshifting in the L-A virus 
and retroviruses, the minimal sequence required to 
direct translational frameshifting in this region 
15 begins at about the start of the slippery site 
(i.e., at the position numbered 1958) and ends 
. with the last base of the 3' pseudoknot region (at 
position 2024), inclusive, from the 5' end of the 
(+) RNA strand, as shown in Figure 10. 
20 This requirement for the secondary 

structure downstream from the L-A slippery site 
stands in contrast to a report about the sequence 
requirements for frameshifting in HIV in which 
Wilson et al., (1988. HIV expression strategies: 
25 ribosomal frameshifting is directed by a short 
sequence in both mammalian and yeast systems. 
Cell 55:1159-1169) teach that frameshifting in HIV 
is directed by the slippery site alone and is not 
dependent on downstream stem- loop structures. In 
30 the present system, it was found that deletion 
near the frameshifting region on the 5' side of 
the slippery site, up to position 1956, has no 
effect on frameshifting. Substitution of another 
base for one or two of the first three G's in the 
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slippery site (GGGUUUAGG) interferes with 
frameshifting, but frame shifting is restored if 
all three of these G's are replaced with the same 
base. On the 3' side, deletions up to the 3' end 
S of the 3' pseudoknot sequence have no effect, . 
whereas base substitutions in the 3' pseudoknot 
interfere with frameshifting. Furthermore, base 
changes in the 5 r pseudoknot sequence that render 
this sequence noncomplementary to the 3' 

10 pseudoknot sequence also interfere with 

frameshifting. Accordingly, it will be recognized 
by one of ordinary skill in the art of genetic 
engineering that certain substitutions of base 
sequences can be made in this L-A virus 

15 frameshifting region without affecting the 

frameshifting function and, therefore, without 
departing from the spirit and purview of this 
application and scope of the appended claims* 
Specifically, in addition to tlie changes cited 

20 above, changes in the stem-loop structure which 
preserve its. inherent complementarity, its 
approximate size, and the relative positions of 
the pseudoknot elements can be made without 
materially affecting the frameshifting function* 

25 The. vectors described above can now be 

« 

used for the rapid screening of drugs potentially 
affecting the efficiency of -1 ribosomal 

frameshifting using an embodiment of the invention 

•« 

comprising a microbial screening assay on 
30 * semisolid culture medium. Beta-galactosidase 

produced in vivo can be assayed on petri plates by 
incorporating X-gal {5-bromo-4-chloro-3-indolyl- 
beta-D-galactopyranoside) in the solid medium on 
which the cells carrying the vector are grown by 
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methods well known in the art (T.J. Silhavy, 
Berman ML, and Enquist LM; Experiments with gene 
fusions. Cold Spring Harbor Laboratory, Cold 
Spring, Harbor, NY, 1984) . The beta-galactosidase 
5 produced releases a blue dye from the X-gal, and 
the intensity of the color is an extremely 
sensitive indication of the efficiency of -1 
ribosomal f rameshifting. This color intensity is 
capable of reproducibly indicating differences in 

10 beta-galactosidase production as low as 2 fold, 
but it is also capable of indicating large 
difference as well* The low (2%) level of enzyme 
produced by the -1 ribosomal frameshifting vector 
results in a light blue color while the 100% 

15 vector (0 shift) gives very dark blue and the 
0.05% level of the +1 shift vector gives white 
colonies • 

These vectors have been used to isolate 
mutants in chromosomal genes affecting the 

20 efficiency of frameshifting. Starting with the 
light blue wild-type host carrying the -1 vector, 
white and dark blue mutants were picked. The 
mutants were assayed for beta-galactosidase 
directly and the white mutants always had reduced 

25 amounts while the dark blue mutants always had 

increased amounts of enzyme activity, confirming 
the accuracy of the plate assay. 

The identical vector in the same strain, 
or a strain with increased permeability to drugs, 

30 is used to screen drugs on plates. Drug solutions 
are applied directly to a newly seeded lawn of the 
vector-containing strain on an X-gal plate. The 
effect of the drug on the blue color development 
indicates the effect of the drug on -1 ribosomal 
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f rameshif ting , Gradient plates with an under 
layer of agar containing the drug could also be 
Used.- Potentially active drugs could also be 
tested for effects on cells carrying the 0 frame 
5 or +1 frame vectors or no vector, as controls for 
effects on other steps in the degradation of X- 
gal. Determination of effects on growth of the 
cells provides a simple first screen for drug 
toxicity. 

.10 Using such a test, thousands of drugs can 

be screened quickly and cheaply since solid medium 
for yeast ' growth is very inexpensive and the 
manipulations are simple. It is expected that an 
agent which showed either a substantial increase 

15 or decrease in ribosomal'f rameshif ting efficiency 
in the present system would adversely affect 
retroviral .reproduction and thus be useful as an 
. antiviral agent, ' 

-r 

Use of L-A clones to make antigen-carrying 

20 particles for' Immunization 

Particle-bound antigens have long been 
known to be more antigenic than soluble antigens 
(Adams et al., i987, Nature 329:68-70) . Using the 
process of Adams et al . , gene fusions or ORF1 

25' encoding the major coat protein of L-A are made 

with the gene of the protein antigen against which 

• • , 

immunisation is desired. The fusion is introduced 
into yeast and expressed. The particles formed by 

the major coat protein then have the protein 

* * » 

30 antigen attached at multiple locations on each of 

■ 

. the particles. . Expression at various levels in 

• * 

the presence or absence of endogenous L-A virus 

..«».. * 

• *• • . 

• " -can be used to produce particles having various 
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numbers of antigen side chains per particle. The 
DMA sequence for the protein antigen is introduced 
at various sites within the major coat protein 
gene to determine which sites give the most 
5 immunogenic particles. The particles can be 

purified easily using their* physical properties. 
The particles can then be inoculated to induce the 
desired immune response. 

Use of L-A cDHA clones to express and produce a 
10 desired protein and as a helper for an X cDNA- 

based RNA vector . 

In the replication cycle of the L-A 

virus, the viral (+) strands are an intermediate. 

Viral (+) strands are synthesized by viral 
15 particles that contain the viral double-stranded 

RNA genome and extruded from the viral particles. 

They then serve a dual role; 

(I) These (+) strands 'are the mRNA that 
is used by the ribosomes to make the two viral 

20 proteins needed for viral -packaging and 

replication, namely, the major coat protein (80 
kDa) and the major coat protein-RKA polymerase 
fusion protein <180kDa) • 

(II) . The viral plus strands are 
25 encapsidated by the newly synthesized coat 

proteins to form new virus particles. 

This natural viral replication cycle is 
exploited to synthesize abundant amounts of a 
desired protein. The L-A cDNA clone can also be 
30 modified to encode the desired protein instead of, 
or in addition to, the normally encoded proteins. 
The viral regions and signals necessary for the 
replication, transcription and packaging of the L- 
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A RNA molecule have been defined and delimited. 
These signals are completely included in the 5' 
most 25 bases of the (+) strand and the 3' most 
500 bases of the (+) strand (Esteban et al., 1988, 
5 Proc. Hatl. Acad. Scl. USA , 85:4411-4415). 

Replication requires the 3' terminal 30 bases and 

■ 

a region about 400 bases from the 3' end (the 
Viral Binding Sequence or the Internal Replication 
Enhancer) . In order to produce the proteins, the 

10 region coding for the viral proteins is replaced 
by the gene for the desired protein. The signals 
for transcription and packaging and replication 
are left intact and the desired protein is allowed 
to be expressed. 

15 Since strains having up to 20% of their 

total protein as major coat protein can be 
constructed, the transcription and translation 
signals normally used by the virus are quite 
efficient, although the method' allows one to 

20 modify and improve either. 

The L-A cDNA clone is altered as above to 
contain the desired gene and upstream of the 
modified L-A sequence is placed either a yeast 
promoter or a T7 RNA polymerase promoter. This 

25 vector is then introduced into a yeast strain that 
Carries a normal L-A virus to supply the 
replication and packaging proteins. Then, the 

■ 

transcription of the modified L-A is turned on. 
The (+) strands thus produced (because they have 
30 the packaging sequences) are packaged in virus 
particles replicated and transcribed. The (+) 
strand transcripts are translated to produce the 
. desired product * 
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Another use for the L-A cDNA clone in 
producing the desired protein is as a source of 
large amounts of replication proteins. The normal 
L-A produces the replicase (RNA-dependent RNA 
polymerase) as a fusion protein with the major 
coat protein at about 1% of the amount of coat 
protein alone. While this ratio may be optimal 
for maintenance and replication of L-A in a normal 
host strain, it is probably sub-optimal for 
maximal copy number and expression in the ski- 
mutant hosts that give up to 10-20% of host 
proteins as L-A major coat protein. The mechanism 
of formation of the fusion protein is ribosomal 
frameshifting. This is an inefficient process and 
can be simply bypassed by inserting one nucleotide 
in the region of overlap between the two open 
reading frames of the L-A cDNA clone. If this 
clone is modified so as not to have the packaging 
and replication signals, the transcripts will only 
serve as mRNA. Thus, a large amount of RNA- 
dependent RNA polymerase can be supplied to the 
replicating virus to increase its copy number and 
rate of transcription. 

The invention is now illustrated by the 
following example: 
Example 1 ; 

This example shows the construction of a 
' f rameshif t vector' using the frameshift region of 
the L-A viral sequence. The structure is 
schematically illustrated in Fig. 5. The yeast 
PYK 1 promoter is linked to an AUG start codon 
followed by the frameshift sequence from the L-A 
virus. The region of L-A analogous to HIV or RSV, 
shown in Fig. 5, is the exact region of L-A 
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responsible for frame shifting and was inserted 
during the construction. At the end of the L-A 
frameshift sequence is a termination codon, TAA, 
in the inshif ted t frame, so that ribosomes that 
5 have not shifted will cease translation and not 
have the opportunity to shift further downstream. 
The E. coll lacZ gene encoding beta-galactosidase, 
is placed next and is in the -1 frame relative to 

the AUG. This means that the ribosomes must shift 

* 

10 . back one. base on the mRHA in order to properly 
•' translate this region. Control vectors with beta- 
galactosidase in the original "0" frame or in the 

frame are also made, but are not shown in 
Fig. 5. The vector also carried replication 
15 origins and selectable markers for yeast and E. 
coll. The vectors are transformed into a wild- 
type yeast strain and the levels of beta- 
galactosidase in the "O" frame with the *-l" frame 

and the frame. The amount of beta- 

* • ■ ■• 

20 galactosidase in a colony can also be estimated 
using plates containing X-gal and the blue color 
that results from its breakdown by the enzyme. 
Cells carrying the m -l m vector (the construct 

■j, shown in Fig. 5) are grown on plates containing a 

25 . drug t& be tested for its effects on 

fraiftfrahjLfting. A change in the shade of blue can 
be used as/* simple qualitative screen for an 
increase or decrease of the degree of* 
frameartxtfting. . Thus, hundreds of drugs can be 

* • * 

30 . ' quickly screened for such an effect. More careful 
quantitative assays can then be carried out on 
. promising drugs using the simple beta- 
galactosidase assay of permeabilized whole cells. 

. ■ • 

Since the yeast translation apparatus closely 
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resembles that of animal cells, this method can be 
used to screen and determine the direct effects of 
drugs on the replication of retroviruses in tissue 
culture or in vivo tests. 
5 The same strain carrying the frameshift 

vector can be mutagenized and plated for single 
colonies. Those colonies that are more or less 
blue than the parent strain can then be analyzed 
to detect single-gene host cell mutations causing 
10 increased or decreased ribosomal frameshifting. 
The host genes involved can be cloned and 
characterized and their mammalian counterparts 
examined for similar effects on translation in 
mammalian cells. 

15 Example 2 : 

To produce particles vith regions of the 
HIV-1 p JI protein exposed on the outer surface, 
constructs such as are shown ih Fig. € are 
prepared. The L-A ORF1 encoding the major coat 
20 protein is fused in frame to the p" gene. [In 
other constructs not shown the p" sequence is 
inserted within the L-A ORF1 or at the N- 
terminus.il The expression vector carrying this 
construct is introduced into a yeast host. The 

25 major coat protein with the p" sequences attached 
forms viral particles which are then purified in 
large amounts by conventional methods. The L-A 
ORF1-OBF2 fusion protein is expressed separately 
from a separate vector to determine whether this 

30 fusion protein is, as has been hypothesized, 
necessary for particle formation (Fujimura & 
ffickner, 1988, Cell , 55:663-671). 
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The purified particles are used to induce 
an immune response or to generate antibody 
reagents. The particles are also used as the 
antigen in tests for determining the presence of 
5 antibody in serum of patients or animals. 

* 

Example 3 ; 

The L-A ORF1 and ORF2 are fused in an 
expression vector by simply inserting one base in 
the region of overlap between ORF1 and ORF2. This 
10 vector then expresses the viral RHA polymerase. 
Along with another expression vector expressing 
only the major coat protein, ORFs are used to 
supply needed products to the X dsRNA-based 

* 

expression vector (described in O.S. patent 

IS application Serial Number 07/169,486, filed March 
17, 1988) which has all cis sites necessary for 
replication, transcription and packaging. Using 
the cDNA clone of X, the gene 'for a protein whose 
production is desired can be inserted, introduced 

20 into yeast, regenerated as an RNA virus and 
expressed by the L-A encoded transcription 
apparatus. The cloning of the L-A genome makes 
possible supplying or modifying the protein 
components of the L-A packaging and transcription 

25 apparatus from these clones in such a way as to 
optimise expression from the modified X virus- 
vector. Tills is schematically shown in Fig. 7. 

It is understood that the examples and 
embodiments described herein are for illustrative 

30 purposes only and that various modifications or 
changes in light thereof will be suggested to 
persons skilled in the art and are to be included 
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within the spirit and purview of this application 

and scope of the appended claims. 

* * * 

For purposes of completing the background 
5 description and present disclosure, each of the 
published articles, patents and patent 
applications heretofore identified in this 
specification are hereby incorporated by reference 
into the specification. 
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WHAT IS CLAIMED IS : 

1. A recombinant DNA molecule 

■ 

comprising an indicator gene for detection of 
f rameshifting during translation and a DNA 
segment^ 

wherein said DNA segment encodes that 
portion of the double-stranded rna genome of 
the L-A "virus of yeast consisting essentially 
of a viral nucleotide sequence that directs 
translational f rameshifting between the 
- reacting frame for the major coat protein and 
• the reading frame for the RNA polymerase, and 
wherein said DNA segment is located 
upstream from said indicator gene so that said 
indicator gene is translated in the correct • 
reading f?ame only when frameshifting occurs 
in the messenger RNA sequence transcribed from 
said DNA segment. 

2. The recombinant DNA molecule 
according to claim 1 wherein said viral nucleotide 

- sequence includes bases numbered 1958 to 2024, 
. inclusive, shown in Figure 10 • 

3. The recombinant DNA molecule 
according to claim 1 wherein said indicator gene 
is selected from the group consisting of the beta- 
galactoaidase (lacZ) gene of Escherichia coll, and 
the following genes of Saccharomyces cerevisiaez 
LEG 2; URA3; LYS2 and CAN1. 
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4. The recombinant DNA molecule of 
claim 3, wherein said DNA molecule includes the 
entire nucleotide sequence shown in Figure 10. 

5. A method for determining the effect 
of an agent on translational frame shifting, said 
frameshifting being directed by a viral nucleotide 
sequence, comprising: 

producing the translation product of 
said indicator gene of said recombinant DMA of 
claim 1 in the absence of said agent; 

producing the translation product of 
said indicator gene of said recombinant DMA of 
% claim 1 in the presence of said agent; and 

comparing the 'amount of said 
translation product produced in the presence 
of said agent with said amount produced in the 
absence of said agent, wherein a difference in 
said amount in the presence of said agent is 
indicative of the effect of said agent on said 
frameshifting. 

6. The method of claim 5 wherein said 
recombinant DMA molecule is contained within a 
host cell. 

7. The method of claim 6 wherein said 
host cell is a yeast cell. 



8. The method of claim 7 wherein said 
translation product is beta-galactosidase. 
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9. The method of claim 8 wherein said 

host cell is cultured on a semi-solid medium and 
said amount of said beta-gal actosidase produced by 
said host cell is determined by the intensity of 
color produced by a dye that is released by 
hydrolysis of a substrate for said beta- 
galactosidase . 

10. The method of claim 7 wherein said 
translation product produced by the recombinant 
DNA molecule of claim 1 as a result of said 
frameshifting is a product for which cells having 
increased or decreased expression can be selected 

: by varying culture conditions. 

11. The method of claim 10 wherein said 
translation product is selected from the group 
consisting of the translation products of the 
following Saccharoaiyces cerevi$iae genes: LEU2; 
URA3; LYS2 and CAN1. 
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Figure 1. 
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401 CATCAACATCnCCCTCCTTACCACnACAACTArffiACCQUri ffllTtA I IAT*ttXCTCCA^I<TCa^ATCT<C^T^TOnUaTCTOUUATC 300 
K ■ » L » AT«l«*l.1f«0QT f fc_g OlfflllOfilVKI 

Peptide III* « « I 

SOI AACCTACCACTCACTATCCATCA C* fCflC* finrrC AACACAATTCCCTT4*SC1 ACT6ACACCT CTACTCATTCSI ATCCTCAnCQCCACACTTTTCTC 400 
KlPVtlOOTAOPtOrAVPfOtSIOSVPOUAOFSt 

*oi MTCAmcnTCAATCttOTCCCOTACCTAttTGTTA^^ TOO 
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* • HI (fiZ-Z**). - • • • 

1101 CCC6A6CfCCT6nCACnACfi0CAf6CA00TC1CA6TA0C0nAAACACC60ACT 1200 
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1801 CCACTATATAJCMACTCMWQCAQttTWCCCCATQ 1000 
Tt TOIIVOAOAIAVTIASt t I10YVKPVST6I 0 

• • • « • »...««t M< fr— ifclf t »«|Ljcnc«f 

1901 4XTCATCAAT6CO0CC4AArt T AAfiAACTACTOCQCIACTCT AATCCCT 20QO 

VfOAOElCKYVCSVftlTQQClCVVClTMPAVKP 

CASrSACFCSfitSYOASCtAY 

........ .... . • . . • • . 

2001 ACCGCA6AACCTACA0CT COCCCT CCCCACQtACACn&ATAfiAACACGCCaiCAATCTTTTA^ A6ACTAAACST AATCGAAfmr*ri>fC6ACCC06 2100 

TCCPTAOAAIECtlEOAIIVtVE* 
CtTYfVtCPttVOtTCCQCFftVIVICPSlSPt 

2101 CCCTAaUKC|AaTAtTOtltt*AJXIWA^ 2200 
rilYIlQEPOTTPAtfltFtltVOAVStOCATIF 

2201 ncniCACATCGlACClGCCGCAfiTAATTACT 2300 
t F • 1 V P A A V 1 SOFTTSDT SSFAYCSITYAVOVTA 

2301 CATTOUCTCAOTaCACmTCCOTCTAOCTACA^ 2400 
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2401 ACACGtCCCMGCTffTCnATMCAOtCCTACC6TTAT66ATTC6CTA60EAAf ATCT IGGACCTACAC0CCAA1 A1 ACCACCCAAACACt t CAACGC1 2500 

ovAcrcrirtfTviiofCAaitovotiiircaFCfi 

2301 iii<xgrT4T*raAAi^CTOC«r4rt«^r«Ajw^ 2600 
ItLTTISCVTAORHTIlRrOElVCAAACVSfttC 

2601 AATACTACnMTCTSICTACnCAfiCTCCTCSCCAACI TA£AA£TA6AtCTTCAAfiCACCA6TA6CTACTAf TCTCCCAf ATGTCCT OACACIAACTCA 2700 
rrtMCVVClLAHLQVOtCAAVAIIlAVVLTCSC 

2701 AAAATffCtAccurmcnGCAnaACACcmArcttT 2000 

C f V t I f t 0 t t&1UVCCtCrftAL1*tlKA$f t QI 

2001 AACAOCATAOLCACCCCTCAnACaACCACTCACTaU^ 2900 
KSIITAOVCPCTClFEtAVLMilCVCIVSWOACK 

2901 AttATCAtCOmaATCCCfilflCTCfinCTACTTCAIW 3000 
DllllPOVAVVOOAKlTSCVtOMftGSKQTTKY 

3001 TCCOTTAfGACOTCOttaCACTACACTQCAAAaW 3100 

rriiTifooTTANRVCvyrccsvisoTccoforir 

3101 CCIC0TCAfiTATA CT A66AACAAfinCATAACTCTTAACAAMT6C C CAAACACAAAAIA^ 3200 
PCOYTAICftTVIICKtClClCtfttAStrCVtAVT 

3201 CCTCCACSAACTAfXAATCOCCCAACOACCTOC^^ 3300 
ST£YfUS_J^Q£AXTCT0LtSfLtT«rAI(fACCOV 
• AAA polymerase 

3301 TnCACfCACAAmCCCACTACQCCACCAfifiCAO 3400 
LffKft VC0Q £EAArVIKKVIMMLe6ASSrCfi 

3401 IATW6ACnCAATTC1CACCAnCAATAIgTA6TAT6TAIAC6CTin 3S00 
YDOFKSCISIASNYTVlCAFIOTrSAIHSOEQAE 

3501 AGGCOATGAACTCCGTCTfiTCACTCCGTCACAT^ 3600 
AA«WVCt$VlA»IVVL0P0TtCUIILOiI L t J_J V 

3601 GCCCTTAACCACAmATOAACAaerOCTAAACTGCGCCTATAT 3700 
A L I T f « £_T VlgVAYNtlACVFtlOtVODSVIttfi 
MA polymerase • . . . • ♦ 

3701 WTOATPWAfCATTACTCTCAACCCCOTQAOCACAOCtf 1800 
VMISLKAVSTAVtl»9ANIIIIARAflF'ACC«l 



3001 TGTTTTCCATMCCOAAXnXTGAOOSTAfiAACACCOT^ 3900 
FStSEFLIVEi€ll|66»6l6AQ1lItSCAflVI 

3901 CACTACaTT6ACTCTa4 Cfi fcA £aa CTWTACTAC0AgnATC C M CC 4000 
SlICSICrilVVIVMCAOOAItlOLAMtTtVOS 

• «•••••«• I*— X dsABA— > 

4001 OCGGfAACAOCfiATAAAAOACAAaCOACAAACStCTCAaAACA 4100 
AVTAlCEOlOCAVTCtrcV60ftVVt91ITA«tVC 

4101 OTGOCGCTATCTCOACTGAfACCTGCGCACOXn 4200 
CCISTOTVAPVETCI IT0IEAYEt»YEIOO»SF 

4201 TTCCCCAOOCCTAAATCATI ATOCn A1 AAAfiY CTCCAAAAA1YY CG6A6AAC0ACT C6AATHM1 AAfiATTAAACATOCCSTAfiCT ACA6C0ACT ACS 4300 
V»CV«0YAYCVV*«fCEtLEFICfC0AVAIOIt 

4301 AOCACWACCTCIOAAAC CT AAASCTACW^ 4400 
SIIAtrtKAAtTSCKMEFAICSEVEtlflTCATKG 

4401 CTnOQCACTCTCAIACTATOCTAACCTOAg 4S00 
LAVtTTABltKFMttffKAllCFCtAATANOAA 

4501 CCncaAOTCTCATCCACTCCOO^nACAOCTCATAgCT 4S79 
lftStOft.ftAt.QVIl* < •frUer t 



WO 90/ 1 2488 PCT/US90/01717 
Figure 3- 



A/ll 



(CCA 
0 C 
A C 
€ - 0 
C- 6 

a - a 

L-A .e 5: *. * M ' ke " 

0- c 

s-c 

6 - C 
A - U 
d-A 
6 - C 

6 6CUA60 6V6C6 0C6 0 A C 0 C A 6 C AgTfiUJ^AGJAC f-6 6 A 6 A A C C U A C AG 
Nijor Cett frpt • QAM - - > S 1 o 6 I y I <« VTy * 1 1 

Clyfkc Arf Str »»0tF2-t«Jk polyiernt 

A<\ 
0-6 
C - 6 
C - 6 

U It/ 4 -17-5 kc«l 

HIV-I A 

c-c 
c-« 

6-C 
0- A 

. . C- 6 

66CCCT AA hJDOPCqA6 6U A«A&A0-6CAAU 
9 « 9 0 R F ► A tuPht 1 < a 6 I y A r | 

T«rf hePbTTrf 6 lo |>-pol OtF 



o 

0 - A 

o c 

6-C 
0- A 

Rsv j-| 

c-c 

6-C 

A-tt 
6-C 
6-C 

6-C 

CC6C B06 A C JAAAPOH a57| - 16AA66P A AA 

9 * g > Arq t « « T h VXTSTTJT? r 



NUC 24381 



WO 90/12488 

Figure 4 



PCIYUS9Q/0i717 



OH CO 

o<u. 
> >> 

• ooo 

• 0. 0. Q. 
•ZOO 

o o o 

• o oo 



0 CO CO 
CT u. U. 

j j 
o oo 

CLfluCU 

ooo 
oo 
ooo 



CO CO CO 

ooo 



>co Oj 
ooo 



2 

2 



3» 



aw 

O. -J 

• ooo 

•Sfc-crar 

CO HO 



"uT 

< 

a 
o 
o 



^O 0.0 o_ > 



0*< 
<< CO* 
O <>- 

Il.l4.-J 

ooo 



'uVTuVLC 

>-<H 

CO <* 



O.ZZ 

ooo 



202 
^ cc a: 
t-02 



o co 
^o" 

Ul Ul Z 

'cClCVT 

O O Ul 



22a 

022 

• • • 



a 
til 

o 

a: 

"S+ 







CO U. CO CO 


CO + 




>>> 


>~<< 






> -J 


— >> 




2 + 


O-f 


zoo 


-J > > > 




+ 


U5"cb 


AAA' 








loo 


ooo 


oooo 


Q 




OO 


ooo 


o o o o 


o 






CO CO 








Z> 


u.>> 


<o<< 








<<< 






+ 


•J >» 


<22 


«h22 


CO 




K> 






o 




COm 


V 10 ^ 


JJJL 


a 






CO -J -J 


*<XO 










• QT • * 


o 






HklO 


OOOO 


o 


+ 


2Z 


-Jiil o 


JU.JJ 


-J 


+ 


•J 


a o o 


o u. o o 


o 


+ 


• o 


uzx 




Lu 




•o 


Ul Ul Ul 


00.00 


> 




- 2* 




*co 


o 




• Ul 


> -1 -J 


>-*>->- 


•< 




• o 




<Ot-> 






• 1- 




OU. idM 


u: 


+ 




• a: ui 


Jh J J 


2 










>• 






• * a: 


-J CO -J -J 








• CLt- 


• u. • • 








•O-J 


• > • • 







CO CO CO 

ozz 

"cVVyVi^' 
"Cloio - 

O Ul UJ 



CO 

I- 



* -J 

1-2 



o a 
o o 



cm N 



CO id -J 

<oa 



h>-*-h 
co-ko: 



~ t-<* ~>_j_ 



■ la: gg 



p<o 



ggg 
lO tr» 



unsr 



CO o o 
Ui o* 

-J Ul< 

Si: 



c 
o 

o 





Ul Ul Ul 

ooo 

JJh 
U. Lu >- 

CM CO K 
O CD CO 



jjj 

Ul Ul Ui 

ooo 

o->> 
o>-x 




oo — 

22 J= 
U-UlfiC 



ftO o to 

o o> 

to 



*• o o 
o— « 

ex— x 



XO.O -J 



Ul 
O. 
CL 

to 



CM 

u. 
tc 
o 



"3 

!-.>> 
TOO. 

o o^> 

CO CO CO 

srore 

2KK 

300 

-©"fine" 

J* CO CO 



OOOO 
CO O. CO CO 
OCOOO 

0000 

CO CO CO CO 

2222 
OCOOO 

cycVo'cT 
uTCTSTaT 




* o ^00 

CM CO O CM O 

to 10 co *~<r- 

CM »0 IO 
U. 

— JZ OU- * 

00 u © o 

CUI A 



o o 

Ol~ 



TJZ « 

2i>- 



0*-l0 0 
OO NO 
O «-OlQ 
CM — CM 



©«•- o o 
c o — « 

— ft.— X 

X • o o 

eeXfluO 



CM 

u. 

O 



o 



C9 

o: 



NUC 24382 



WO 90/12488 

Figure 5 



pcr/us9e/onn 

6/11 



Frameshift Vector Based on 
the L-A Virus 



Yeast 

Promoter 



Frameshift Sequence 8eU-GaUcto&UU*« Yeast and Exofl repUcons 
AUGX X~. From L-A mRNA — tn-1 Frame I and selectable markers 




ranscription 



This stem- loop slows 
rtbosomal movement along 
the mRNA 



Translation start 
here sets the 
Initial frame 



u O 

A C 
G-U 

C-G 
A-U 

u-A 
c u 

UH3 



This sequence allows pepUdyfl 
and amlnoacy! tRNAa to sfip 
\ back one base on the mRNA 
\. (-1 frameshift event) 

mRNA-^AUGXXX 



~ ** Translation termlnaUon 
qJq signal In Frame V. 

y Host translations 



G-C 



GGGOOOAGGAGU-GGA' 



,t£a^. 



\1$q here. 



pnrtdn > Net— --rrame"Q" — ->Glyl^tiGlY 

The fnuneshf fled ^ 
protein sequence 
ahould be LeuArg 
Instead of LauGly 
for the unshtfted protein 



• • • 



. Beta-Galactosldase 



Reading continues m Frame M* 
In about 1% of translations 
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Yeast 
Promoter 



L^A Major Coat Protein gene- p24 gene 




mRMA 



Normal 
Monomers 



ormat 
Partlelat 



Transcription 




J, Translation 

Hybrid 

Monomers 6 Viral 
•6 *6 




sembty 



Replication Origin 
and selective mar kers 
for yeast and EL coll 



Particles Decorate 
with antigen 
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Natural L— A 
viral mRNA 




major coat protein 



Pacudotaot 



GGGTTTAGG - 1 ' """ 
i ii i 

I II I ►P°l OBP 



Frcimeshift Screening Vector 



PGK1 L-A slippery 

promoter AUG sequence 



Beta-galactosidose yeast 
in the —1 frame ori 



pBR322 



TRP1 



mRNA produced 

in vivo by fromeshift 

screening vector 



ATG- 



0 frame 



GGGTTTAGG 
i ii j 




Pseudoknot 



JL 



J- 



— la.cZ in -1 frame 
beta-galactosldase 



♦ 
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1.6 a of P« tequentes fro* t MlndUt site to II 

6ATC6TACT6 TTACTCTCTC TCTTTCAAAC IfiUmttt 6AATCCT6T6 
ACAACAACAfi CCT6TTCTCA CACACTCTTT TCTTCTAACC AAGGGGGT6G 
TUA6TTTA6 TA6AACCTC6 T6AAACTTAC AJTTACATAT ATATAAACTT 
GCATAAATT6 6TCAATGCAA GAAATA&ATA TTTGGTCTIT TCTAATTCST 
AGTTTTTCAA 6TTCTTAGAT GCTTTCTTTT tCTCTTTTTT ACAGATCATC 
AAGGAAGTAA TTATCIACTT TTTACAACAA AT || CTA 

*$Urt of transcript In this direction — » 

6AATTC ATACAAA AT6 ACT TCT Afifi 

|Tr«asUt(oa $Urt Slippery 5'Pseedokeot 

t-A base # SIU *t region *l 

1908 AAT6CS6fiflUACnAASAACTACT666^ 1QQ0 

K A 6 C I K R TV 6SYRITQQ6L6VV61 T H f A V N P 

CASTSAGFASSRSTDASCRAT 

3'Pseudoknot 

region ..... . . 

2001 A tBKAGARCCIACA6CT6CC6CT<iC^^ 2100 

♦! 41 ♦! 0 0 

TGCFfA6AAMCCl!CQA*IVLVt* 

RRTYSVRCFRRV0RI6GQCF$RVlV!tPSH6FR 

» 

2101 COTACAA66TACATACTGCA6 CCC 
P T t T 1 I Q 

pFI A pF9 hm 

1 & 2 extra 6*t Here, respectively 
I 

60S CTA CC 
S**l Kpnl-AspllS 

6AT CCC 6TC 6TT TTA CAA CGT CGT6ACTG6 6AAAACGCT6 GCSTTACOCA 

Asp Pro Vol Vol Loo UcZ to the *1 freee to pf7 

. In the 0 free* to pf? 
In the -1 fraae fo pf* 

ACTTAATCGC CTTGCA6CAC ATCCCCCTTT CGCCAGCTS6 C6TAATAG06 
AA6AS60CC6 CACC6ATCGC 0CTT0CCAAC ASTTGCBCAfi CCTSAATGGC 
6AATGGCSCT TTGCCTGSTT TCCCGCACCA 6AA60G6T6C CS6AAA 



% 
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Plagttjd 5D1B/3P30X8: 

1.6 KB of PGK sequences fron a KindXIX site. to || 

GATCGTACTG TTACTCTCTC TCTTTCAAAC AGAATTGTCC GAATCGTGTG 

ACAACAACAG CCTGTTCTCA CACACTCTTT TCTTCTAACC AAGGGGGTGG 



TTTAGTTTAG TAGAACCTCG TGAAACTTAC ATTTACATAT ATATAAACTT 
GCATAAATTG GTCAATGCAA GAAATAGATA TTTGGTCTTT TCTAATTCGT 
AGTTTTTCAA GTTCTTAGAT GCTTTCTTTT TCTCTTTTTT ACAGATCATC 
AAGG AAGTAA TTATCTACTT TTTACAACAA AT | j CPA 

A Start of transcript in this direction — > 



TGEPTAGAA 
RRTYSWRCP 

TAG CTCGAGCCCGGG GTA CC 

Xhoi snal KpnI=Asp7ia 

GAT CCC GTC GTT TTA GAA CGT CGTGACTGG GAAAACCCTG GCGTTACCCA 

Asp Pro Val Val Lea LacZ in the -1 frame in 5D18/3D30XS 

ACTTAATCGC CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG CGTAATAGCG 
AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGOGCAG CCTGAATGGC 
GAATGGCGCT TTGCCTGGTT TCCGGCACCA GAAGOGGTGC CGGAAA 



GAATTC ATACAAA ATG ACT TCT AGG 
EcoRl I AA 

(Translation Start 




QG LGVVG It , T K P A V M P 
GFRSG RSYDASCNAY 

3'Pseudoknot 
region 

2001 ACCGGAGAACCTACAGCTGGCGCTGCC 2030 



2000 



v 



r 
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